Improvement of a structured language model: arbori-context tree
نویسندگان
چکیده
In this paper we present an extention of a context tree for a structured language model (SLM), which we call an arbori-context tree. The state-of-the-art SLM predicts the next word from a xed partial tree of the history tree, such as two exposed heads, etc. An arbori-context tree allows us to select an optimum partial tree of a history tree for the next word prediction depending on the e ectiveness in the similar way that a context tree selects the length of the history (n of n-gram). The experiment we conducted showed that the test set perplexity of the SLM based on an arbori-context tree (79.98) was lower than that of the SLM with a xed history (101.56).
منابع مشابه
On the Development of a Model of Discipline-specific Reading Strategies in the Context of Iranian EFL Learners
Abstract Reading strategies are seen as supportive means to help learners process and comprehend English texts effectively. The present research probed to posit a discipline-specific model of reading strategies for Iranian TEFL postgraduate students. The motive behind developing a local model of reading strategy is twofold: first, a variety of postgraduate students admitted for M.A and Ph.D. pr...
متن کاملOn the Development of a Model of Discipline-specific Reading Strategies in the Context of Iranian EFL Learners
Abstract Reading strategies are seen as supportive means to help learners process and comprehend English texts effectively. The present research probed to posit a discipline-specific model of reading strategies for Iranian TEFL postgraduate students. The motive behind developing a local model of reading strategy is twofold: first, a variety of postgraduate students admitted for M.A and Ph.D. pr...
متن کاملStudying impressive parameters on the performance of Persian probabilistic context free grammar parser
In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...
متن کاملCode-Copying in the Balochi Language of Sistan
This empirical study deals with language contact phenomena in Sistan. Code-copying is viewed as a strategy of linguistic behavior when a dominated language acquires new elements in lexicon, phonology, morphology, syntax, pragmatic organization, etc., which can be interpreted as copies of a dominating language. In this framework Persian is regarded as the model code which provides elements for b...
متن کاملA Structured Language Model for Incremental Tree-to-String Translation
Tree-to-string systems have gained significant popularity thanks to their simplicity and efficiency by exploring the source syntax information, but they lack in the target syntax to guarantee the grammaticality of the output. Instead of using complex tree-to-tree models, we integrate a structured language model, a left-to-right shift-reduce parser in specific, into an incremental tree-to-string...
متن کامل